NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Integrating biological and environmental data to solve key scientific and societal challenges

https://doi.org/10.1093/biosci/biaf150

Kunkel, David_M; Long-Fox, Brooke_L; Pittman, Cameron; Portmann, Julia; Sheik, Matthew; Bates, John_M; Bentley, Andrew; Contreras, Dori_L; Ellwood, Elizabeth_R; Lomas, Michael_W; et al (October 2025, BioScience)

Abstract Biodiversity collections in the United States hold over a billion specimens and are essential to understanding the history of life on Earth, as well as patterns of biodiversity in response to environmental change. Each specimen is linked by metadata to an organism's name and the place and time of its collection. Extensive data have been collected on Earth's geology, hydrology, climate, and organisms—past and present—but the data remain largely fragmented. We report in the present article on community discussions to develop a roadmap and identify action items for the Building an Integrated, Open, Findable, Accessible, Interoperable, and Reusable (BIOFAIR) Data Network, directly linking the various types of biological and environmental data. The roadmap is organized into five themes: stocktaking and gap analysis, technological capacity building, best practices, education and training, and community building. Together, these themes chart a path from initial resource inventories and skill building to infrastructure development, cross‑disciplinary collaboration, and the establishment of FAIR‑compliant workflows and governance.
more » « less
Community Action: Planning for Specimen Management in Funding Proposals

https://doi.org/10.1093/biosci/biae032

Bentley, Andrew; Thiers, Barbara; Moser, William E; Watkins-Colwell, Gregory J; Zimkus, Breda M; Monfils, Anna K; Franz, Nico M; Bates, John M; Boundy-Mills, Kyria; Lomas, Michael W; et al (June 2024, BioScience)

Full Text Available
Connecting the Dots: Aligning human capacity through networks toward a globally interoperable Digital Extended Specimen (DES) infrastructure

https://doi.org/10.3897/biss.7.112390

Ellwood, Elizabeth R.; Addink, Wouter; Bates, John; Bentley, Andrew; Buschbom, Jutta; Freire-Fierro, Alina; Fortes, Jose; Jennings, David; Lehnert, Kerstin; Ludäscher, Bertram; et al (September 2023, Biodiversity Information Science and Standards)

Thanks to substantial support for biodiversity data mobilization in recent decades, billions of occurrence records are openly available, documenting life on Earth and enabling timely research, awareness raising, and policy-making. Initiatives across local to global scales have been separately funded to serve different, yet often overlapping audiences of data users, and have developed a variety of platforms and infrastructures to meet the needs of these audiences. The independent progress of biodiversity data providers has led to innovations as well as challenges for the community at large as we move towards connecting and linking a diversity of information from disparate sources as Digital Extended Specimens (DES). Recognizing a need for deeper and more frequent opportunities for communication and collaboration across the globe, an ad-hoc group of representatives of various international, national, and regional organizations have been meeting virtually since 2020 to provide a forum for updates, announcements, and shared progress. This group is provisionally named International Partners for the Digital Extended Specimen (IPDES), and is guided by these four concepts: Biodiversity, Connection, Knowledge and Agency. Participants in IPDES include representatives of the Global Biodiversity Information Facility (GBIF), Integrated Digitized Biocollections (iDigBio), American Institute of Biological Sciences (AIBS), Biodiversity Collections Network (BCoN), Natural Science Collections Alliance (NSCA), Distributed System of Scientific Collections (DiSSCo), Atlas of Living Australia (ALA), Biodiversity Information Standards (TDWG), Society for the Preservation of Natural History Collections (SPNHC), National Specimen Information Infrastructure of China (NSII), and South African National Biodiversity Institute (SANBI), as well as individuals involved with biodiversity informatics initiatives, natural science collections, museums, herbaria, and universities. Our global partners group strives to increase representation from around the globe as we aim to enable research that contributes to novel discoveries and addresses the societal challenges leading to the biodiversity crisis. Our overarching mission is to expand on the community-driven successes to connect biodiversity data and knowledge through coordination of a globally integrated network of stakeholders to enable an extensible technical and social infrastructure of data, tools, and working practices in support of our vision. The main work of our group thus far includes publishing a paper on the Digital Extended Specimen (Hardisty et al. 2022), organizing and hosting an array of activities at conferences, and asynchronous online work and forum-based exchanges. We aim to advance discussion on topics of broad interest to our community such as social and technical capacity building, broadening participation, expanding social and data networks, improving data models and building a backbone for the DES, and identifying international funding solutions. This presentation will highlight some of these activities and detail progress towards a roadmap for the development of the human network and technical infrastructure necessary to support the DES. It provides an opportunity for feedback from and engagement by stakeholder communities such as TDWG and other initiatives with a focus on data standards and biodiversity informatics, as we solidify our plans for the future in support of integrated and interconnected biodiversity data and credit for those doing the work.
more » « less
Full Text Available
Highlights and Outcomes of the 2021 Global Community Consultation

https://doi.org/10.3897/biss.5.72716

Ellwood, Elizabeth R.; Bentley, Andrew; Buschbom, Jutta; Hardisty, Alex; Mast, Austin; Miller, Joe; Monfils, Anna; Nelson, Gil; Paul, Deborah L (August 2021, Biodiversity Information Science and Standards)

International collaboration between collections, aggregators, and researchers within the biodiversity community and beyond is becoming increasingly important in our efforts to support biodiversity, conservation and the life of the planet. The social, technical, logistical and financial aspects of an equitable biodiversity data landscape – from workforce training and mobilization of linked specimen data, to data integration, use and publication – must be considered globally and within the context of a growing biodiversity crisis. In recent years, several initiatives have outlined paths forward that describe how digital versions of natural history specimens can be extended and linked with associated data. In the United States, Webster (2017) presented the “extended specimen”, which was expanded upon by Lendemer et al. (2019) through the work of the Biodiversity Collections Network (BCoN). At the same time, a “digital specimen” concept was developed by DiSSCo in Europe (Hardisty 2020). Both the extended and digital specimen concepts depict a digital proxy of an analog natural history specimen, whose digital nature provides greater capabilities such as being machine-processable, linkages with associated data, globally accessible information-rich biodiversity data, improved tracking, attribution and annotation, additional opportunities for data use and cross-disciplinary collaborations forming the basis for FAIR (Findable, Accessible, Interoperable, Reproducible) and equitable sharing of benefits worldwide, and innumerable other advantages, with slight variation in how an extended or digital specimen model would be executed. Recognizing the need to align the two closely-related concepts, and to provide a place for open discussion around various topics of the Digital Extended Specimen (DES; the current working name for the joined concepts), we initiated a virtual consultation on the discourse platform hosted by the Alliance for Biodiversity Knowledge through GBIF. This platform provided a forum for threaded discussions around topics related and relevant to the DES. The goals of the consultation align with the goals of the Alliance for Biodiversity Knowledge: expand participation in the process, build support for further collaboration, identify use cases, identify significant challenges and obstacles, and develop a comprehensive roadmap towards achieving the vision for a global specification for data integration. In early 2021, Phase 1 launched with five topics: Making FAIR data for specimens accessible; Extending, enriching and integrating data; Annotating specimens and other data; Data attribution; and Analyzing/mining specimen data for novel applications. This round of full discussion was productive and engaged dozens of contributors, with hundreds of posts and thousands of views. During Phase 1, several deeper, more technical, or additional topics of relevance were identified and formed the foundation for Phase 2 which began in May 2021 with the following topics: Robust access points and data infrastructure alignment; Persistent identifier (PID) scheme(s); Meeting legal/regulatory, ethical and sensitive data obligations; Workforce capacity development and inclusivity; Transactional mechanisms and provenance; and Partnerships to collaborate more effectively. In Phase 2 fruitful progress was made towards solutions to some of these complex functional and technical long-term goals. Simultaneously, our commitment to open participation was reinforced, through increased efforts to involve new voices from allied and complementary fields. Among a wealth of ideas expressed, the community highlighted the need for unambiguous persistent identifiers and a dedicated agent to assign them, support for a fully linked system that includes robust publishing mechanisms, strong support for social structures that build trustworthiness of the system, appropriate attribution of legacy and new work, a system that is inclusive, removed from colonial practices, and supportive of creative use of biodiversity data, building a truly global data infrastructure, balancing open access with legal obligations and ethical responsibilities, and the partnerships necessary for success. These two consultation periods, and the myriad activities surrounding the online discussion, produced a wide variety of perspectives, strategies, and approaches to converging the digital and extended specimen concepts, and progressing plans for the DES -- steps necessary to improve access to research-ready data to advance our understanding of the diversity and distribution of life. Discussions continue and we hope to include your contributions to the DES in future implementation plans.
more » « less
Full Text Available
Digital Extended Specimens: Enabling an Extensible Network of Biodiversity Data Records as Integrated Digital Objects on the Internet

https://doi.org/10.1093/biosci/biac060

Hardisty, Alex R; Ellwood, Elizabeth R; Nelson, Gil; Zimkus, Breda; Buschbom, Jutta; Addink, Wouter; Rabeler, Richard K; Bates, John; Bentley, Andrew; Fortes, José A; et al (August 2022, BioScience)

Abstract The early twenty-first century has witnessed massive expansions in availability and accessibility of digital data in virtually all domains of the biodiversity sciences. Led by an array of asynchronous digitization activities spanning ecological, environmental, climatological, and biological collections data, these initiatives have resulted in a plethora of mostly disconnected and siloed data, leaving to researchers the tedious and time-consuming manual task of finding and connecting them in usable ways, integrating them into coherent data sets, and making them interoperable. The focus to date has been on elevating analog and physical records to digital replicas in local databases prior to elevating them to ever-growing aggregations of essentially disconnected discipline-specific information. In the present article, we propose a new interconnected network of digital objects on the Internet—the Digital Extended Specimen (DES) network—that transcends existing aggregator technology, augments the DES with third-party data through machine algorithms, and provides a platform for more efficient research and robust interdisciplinary discovery.
more » « less
Full Text Available
Bridging the Research Gap between Live Collections in Zoos and Preserved Collections in Natural History Museums

https://doi.org/10.1093/biosci/biac022

Poo, Sinlan; Whitfield, Steven M; Shepack, Alexander; Watkins-Colwell, Gregory J; Nelson, Gil; Goodwin, Jillian; Bogisich, Allison; Brennan, Patricia L; D'Agostino, Jennifer; Koo, Michelle S; et al (April 2022, BioScience)

Abstract Zoos and natural history museums are both collections-based institutions with important missions in biodiversity research and education. Animals in zoos are a repository and living record of the world's biodiversity, whereas natural history museums are a permanent historical record of snapshots of biodiversity in time. Surprisingly, despite significant overlap in institutional missions, formal partnerships between these institution types are infrequent. Life history information, pedigrees, and medical records maintained at zoos should be seen as complementary to historical records of morphology, genetics, and distribution kept at museums. Through examining both institution types, we synthesize the benefits and challenges of cross-institutional exchanges and propose actions to increase the dialog between zoos and museums. With a growing recognition of the importance of collections to the advancement of scientific research and discovery, a transformational impact could be made with long-term investments in connecting the institutions that are caretakers of living and preserved animals.
more » « less
Full Text Available
The Extended Specimen Network: A Strategy to Enhance US Biodiversity Collections, Promote Research and Education

https://doi.org/10.1093/biosci/biz140

Lendemer, James; Thiers, Barbara; Monfils, Anna K; Zaspel, Jennifer; Ellwood, Elizabeth R; Bentley, Andrew; LeVan, Katherine; Bates, John; Jennings, David; Contreras, Dori; et al (November 2019, BioScience)

Full Text Available
Importance of timely metadata curation to the global surveillance of genetic diversity

https://doi.org/10.1111/cobi.14061

Crandall, Eric_D; Toczydlowski, Rachel_H; Liggins, Libby; Holmes, Ann_E; Ghoojaei, Maryam; Gaither, Michelle_R; Wham, Briana_E; Pritt, Andrea_L; Noble, Cory; Anderson, Tanner_J; et al (March 2023, Conservation Biology)

Abstract Genetic diversity within species represents a fundamental yet underappreciated level of biodiversity. Because genetic diversity can indicate species resilience to changing climate, its measurement is relevant to many national and global conservation policy targets. Many studies produce large amounts of genome‐scale genetic diversity data for wild populations, but most (87%) do not include the associated spatial and temporal metadata necessary for them to be reused in monitoring programs or for acknowledging the sovereignty of nations or Indigenous peoples. We undertook a distributed datathon to quantify the availability of these missing metadata and to test the hypothesis that their availability decays with time. We also worked to remediate missing metadata by extracting them from associated published papers, online repositories, and direct communication with authors. Starting with 848 candidate genomic data sets (reduced representation and whole genome) from the International Nucleotide Sequence Database Collaboration, we determined that 561 contained mostly samples from wild populations. We successfully restored spatiotemporal metadata for 78% of these 561 data sets (n = 440 data sets with data on 45,105 individuals from 762 species in 17 phyla). Examining papers and online repositories was much more fruitful than contacting 351 authors, who replied to our email requests 45% of the time. Overall, 23% of our email queries to authors unearthed useful metadata. The probability of retrieving spatiotemporal metadata declined significantly as age of the data set increased. There was a 13.5% yearly decrease in metadata associated with published papers or online repositories and up to a 22% yearly decrease in metadata that were only available from authors. This rapid decay in metadata availability, mirrored in studies of other types of biological data, should motivate swift updates to data‐sharing policies and researcher practices to ensure that the valuable context provided by metadata is not lost to conservation science forever.
more » « less

Search for: All records